Don't Be Greedy: Leveraging Community Structure to Find High Quality Seed Sets for Influence Maximization
نویسندگان
چکیده
We consider the problem of maximizing the spread of influence in a social network by choosing a fixed number of initial seeds — a central problem in the study of network cascades. The majority of existing work on this problem, formally referred to as the influence maximization problem, is designed for submodular cascades. Despite the empirical evidence that many cascades are non-submodular, little work has been done focusing on non-submodular influence maximization. We propose a new heuristic for solving the influence maximization problem and show via simulations on real-world and synthetic networks that our algorithm outputs more influential seed sets than the state-of-the-art greedy algorithm in many natural cases, with average improvements of 7% for submodular cascades, and 55% for non-submodular cascades. Our heuristic uses a dynamic programming approach on a hierarchical decomposition of the social network to leverage the relation between the spread of cascades and the community structure of social networks. We verify the importance of network structure by showing the quality of the hierarchical decomposition impacts the quality of seed set output by our algorithm. We also present “worst-case” theoretical results proving that in certain settings our algorithm outputs seed sets that are a factor of Θ( √ n) more influential than those of the greedy algorithm, where n is the number of nodes in the network. Finally, we generalize our algorithm to a message passing version that can be used to find seed sets that have at least as much influence as the dynamic programming algorithms.
منابع مشابه
Influence Maximization Algorithm Using Markov Clustering
Social Network Services are known as a effective marketing platform in that the customers trust the advertisement provided by their friends and neighbors. Viral Marketing is a marketing technique that uses the pre-constructed social networks to perform maketing with small cost while maximizing the spread. Therefore, which seed user to select is the primary concern in viral marketing. Influence ...
متن کاملTime is What Prevents Everything from Happening at Once: Propagation Time-conscious Influence Maximization
The influence maximization (IM) problem as defined in the seminal paper by Kempe et al. has received widespread attention from various research communities, leading to the design of a wide variety of solutions. Unfortunately, this classical IM problem ignores the fact that time taken for influence propagation to reach the largest scope can be significant in realworld social networks, during whi...
متن کاملRevisiting Non-Progressive Influence Models: Scalable Influence Maximization in Social Networks
While influence maximization in social networks has been studied extensively in computer science community for the last decade the focus has been on the progressive influence models, such as independent cascade (IC) and Linear threshold (LT) models, which cannot capture the reversibility of choices. In this paper, we present the Heat Conduction (HC) model which is a non-progressive influence mo...
متن کاملDistance-Based Influence in Networks: Computation and Maximization
A premise at a heart of network analysis is that entities in a network derive utilities from their connections. The influence of a seed set S of nodes is defined as the sum over nodes j of the utility of S to j. Distance-based utility, which is a decreasing function of the distance from S to j, was explored in several successful research threads from social network analysis and economics: Netwo...
متن کاملRevisiting Non-Progressive Influence Models: Scalable Influence Maximization
Influence maximization in social networks has been studied extensively in computer science community for the last decade. However, almost all of the efforts have been focused on the progressive influence models, such as independent cascade (IC) and Linear threshold (LT) models, which cannot capture the reversibility of choices. In this paper, we present the Heat Conduction (HC) model which is a...
متن کامل